[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

kuafou · 2025-10-29T23:04:29Z

Description

This PR fixes a compatibility issue with recent vLLM changes that now require model classes to implement a get_input_embeddings() method.
Without this method, vLLM fails its interface validation during model registration, breaking TPU model integration.

To address this, we add a dummy get_input_embeddings() implementation to the vLLM-compatible wrapper class in
tpu_inference/models/common/model_loader.py.
Similar to the existing dummy forward() method, this implementation only satisfies vLLM’s type checks and raises
NotImplementedError if invoked. This prevents JAX model initialization during import or introspection.

Why this change is needed

vLLM recently introduced a strict requirement for model classes to define get_input_embeddings()
(link).
TPU inference uses a dummy PyTorch wrapper to register JAX models into vLLM’s registry.
Since this wrapper lacked get_input_embeddings, vLLM failed model registration checks.

Implementation details

Added unimplemented_get_input_embeddings() dummy function to the wrapper type.
Registered it inside the dynamically created wrapper class.
Added a test tests/test_vllm_wrapper.py to ensure:
- The wrapper defines get_input_embeddings().
- The method raises NotImplementedError.
- The class passes is_vllm_model() validation.

Related Issue

Fixes: #951

Tests

pytest -v tests/models/common/test_model_loader.py

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

tests/test_vllm_wrapper.py

karan

Thank you for the PR.

kuafou · 2025-11-23T08:07:45Z

Hi @karan,
Looks like everything has passed and the PR is approved.
When you get a chance, could you help merge it? Thanks! 🙏

kyuyeunk · 2025-11-23T08:34:08Z

Hi @karan,

Looks like everything has passed and the PR is approved.

When you get a chance, could you help merge it? Thanks! 🙏

Thank you for the contribution. I'll run the ci manually and will merge when if it all passes.

kyuyeunk · 2025-11-23T08:36:49Z

hmm seems like the branch is outdated. can you update it?

Signed-off-by: Allen Jia <kuafou@gmail.com>

kyuyeunk · 2025-11-23T08:42:47Z

running ci: https://buildkite.com/tpu-commons/tpu-inference-ci/builds/5856

kuafou · 2025-11-23T08:43:02Z

hmm seems like the branch is outdated. can you update it?

Hi, @kyuyeunk . I had updated this pr.

kyuyeunk · 2025-11-23T09:15:12Z

except for already failing tests, verified that all tests passes.

merging the pr.

kuafou force-pushed the qi/fix-vllm-model-wrapper branch from 79d6f2d to 80ab177 Compare October 29, 2025 23:22

py4 requested a review from karan October 30, 2025 19:30

karan requested changes Nov 3, 2025

View reviewed changes

tests/test_vllm_wrapper.py Outdated Show resolved Hide resolved

kuafou force-pushed the qi/fix-vllm-model-wrapper branch 2 times, most recently from 59e1a8e to bc8fe75 Compare November 5, 2025 18:29

karan approved these changes Nov 5, 2025

View reviewed changes

kuafou added 2 commits November 23, 2025 08:39

add dummy get_input_embeddings to fix vllm model type check

5883fed

Signed-off-by: Allen Jia <kuafou@gmail.com>

add test to test_model_loader.py

85606aa

Signed-off-by: Allen Jia <kuafou@gmail.com>

kuafou force-pushed the qi/fix-vllm-model-wrapper branch from bc8fe75 to 85606aa Compare November 23, 2025 08:40

kuafou requested a review from vipannalla as a code owner November 23, 2025 08:40

kyuyeunk merged commit add0b5b into vllm-project:main Nov 23, 2025
3 checks passed

kyuyeunk mentioned this pull request Nov 23, 2025

[Bug]: vllm model interface now requires get_input_embeddings #951

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

Uh oh!

kuafou commented Oct 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

karan left a comment

Uh oh!

kuafou commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

kuafou commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

Uh oh!

Conversation

kuafou commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Why this change is needed

Implementation details

Related Issue

Tests

Checklist

Uh oh!

Uh oh!

karan left a comment

Choose a reason for hiding this comment

Uh oh!

kuafou commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

kuafou commented Nov 23, 2025

Uh oh!

kyuyeunk commented Nov 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kuafou commented Oct 29, 2025 •

edited

Loading